Mitigating the Cost, Performance, and Power Overheads Induced by Load Variations in Multicore Cloud Servers
نویسندگان
چکیده
Hong, Yu-Ju. Ph.D., Purdue University, December 2013. Mitigating the cost, performance, and power overheads induced by load variations in multicore cloud servers. Major Professor: Mithuna Thottethodi. Load variations whether in space or time pose a significant challenge to system designers. These load variations may induce inefficiencies such as load imbalance and overprovisioning, resulting in performance/power/cost overheads. The goal of my research is to mitigate such variation-induced overheads in multicore cloud servers. First, I focus on power/performance overheads in on-chip networks of a multicore chip. We design an on-chip network that is robust in both performance and energy across applications for timeand space-varying loads. Existing flow control mechanisms that perform well at high (low) loads suffer power and/or energy overheads at low (high) loads. In contrast, our design dynamically adapts flow control to achieve power and performance of the better-suited flow-control mechanism at all loads. Second, I target cost overheads resulting from time-varying loads for applications hosted in an Infrastructure-as-a-Service (IaaS) cloud. While IaaS clouds may enable significant cost-savings by allowing elastic provisioning, the uncertainty of time-varying loads impose additional cost to maintain quality of service. I demonstrate that, with some knowledge of the statistical properties of time-varying load, one can maximize cost-savings while satisfying response-time targets. Finally, I propose to mitigate the impact of data popularity variations in cloud servers. Sharding is a common technique to partition data among scale-out servers. Unfortunately, skewed popularity of data-elements can cause significant load imbalance among shard servers, leading to response time degradation. I design an augmented variant of a well-
منابع مشابه
A Clustering Approach to Scientific Workflow Scheduling on the Cloud with Deadline and Cost Constraints
One of the main features of High Throughput Computing systems is the availability of high power processing resources. Cloud Computing systems can offer these features through concepts like Pay-Per-Use and Quality of Service (QoS) over the Internet. Many applications in Cloud computing are represented by workflows. Quality of Service is one of the most important challenges in the context of sche...
متن کاملIntegrated modeling and solving the resource allocation problem and task scheduling in the cloud computing environment
Cloud computing is considered to be a new service provider technology for users and businesses. However, the cloud environment is facing a number of challenges. Resource allocation in a way that is optimum for users and cloud providers is difficult because of lack of data sharing between them. On the other hand, job scheduling is a basic issue and at the same time a big challenge in reaching hi...
متن کاملEnergy Aware Resource Management of Cloud Data Centers
Cloud Computing, the long-held dream of computing as a utility, has the potential to transform a large part of the IT industry, making software even more attractive as a service and shaping the way IT hardware is designed and purchased. Virtualization technology forms a key concept for new cloud computing architectures. The data centers are used to provide cloud services burdening a significant...
متن کاملLoad Balancing Approaches for Web Servers: A Survey of Recent Trends
Numerous works has been done for load balancing of web servers in grid environment. Reason behinds popularity of grid environment is to allow accessing distributed resources which are located at remote locations. For effective utilization, load must be balanced among all resources. Importance of load balancing is discussed by distinguishing the system between without load balancing and with loa...
متن کاملAn Effective Task Scheduling Framework for Cloud Computing using NSGA-II
Cloud computing is a model for convenient on-demand user’s access to changeable and configurable computing resources such as networks, servers, storage, applications, and services with minimal management of resources and service provider interaction. Task scheduling is regarded as a fundamental issue in cloud computing which aims at distributing the load on the different resources of a distribu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015